Understanding application-level interoperability: Scaling-out MapReduce over high-performance grids and clouds
نویسندگان
چکیده
Application-level interoperability is defined as the ability of an application to utilize multiple distributed heterogeneous resources. Such interoperability is becoming increasingly important with increasing volumes of data, multiple sources of data as well as resource types. The primary aim of this paper is to understand different ways and levels in which application-level interoperability can be provided across distributed infrastructure. Our approach is: (i) Given the simplicity of MapReduce, its wide-spread usage, and its ability to capture the primary challenges of developing distributed applications, use MapReduce as the underlying exemplar; we develop an interoperable implementation of MapReduce using SAGA – an API to support distributed programming, (ii) Using the canonical wordcount application that uses SAGA-based MapReduce, we investigate its scale-out across clusters, clouds and HPC resources, (iii) Establish the execution of wordcount application using MapReduce and other programming models such as Sphere concurrently. SAGA-based MapReduce in addition to being interoperable across different distributed infrastructures, also provides user-level control of the relative placement of compute and data. We provide performance measures and analysis of SAGA-MapReduce when using multiple, different, heterogeneous infrastructures concurrently for the same problem instance.
منابع مشابه
Programming Abstractions for Clouds
Clouds seem like ’Grids Done Right’, including scalability, transparency, and ease of management. Virtual Machines are the dominant application environments for compute Clouds, however, that does not make application programming any less relevant than “non-virtualized” environments. The limited set of successful Cloud applications show that distributed programming patterns of the type of MapRed...
متن کاملA Quick Introduction to Clouds
There is not yet a standard definition for cloud computing, but a good working definition is to say that clouds provide on demand resources or services over the Internet, usually at the scale and with the reliability of a data center. There are at least two different, but related, types of clouds: the first category of clouds is designed to provide computing instances on demand, while the secon...
متن کاملHybrid Computational Infrastructure Supporting eResearch
Important developments -the data deluge, Cloud computing, multicore architectures and growing importance of lightweight clients (tablets and smartphones) -are changing the Cyberinfrastructure (eInfrastructure) supporting eResearch. Many of the detailed features of Grids seem unlikely to survive with clouds replacing many aspects of compute grids. On the other hand, supercomputers and clusters s...
متن کاملPerformance Provisioning and Energy Efficiency in Cloud and Distributed Computing Systems
In recent years, the issue of energy consumption in high performance computing (HPC) systems has attracted a great deal of attention. In response to this, many energy-aware algorithms have been developed in different layers of HPC systems, including the hardware layer, service layer and system layer. These algorithms are of two types: first, algorithms which directly try to improve the energy b...
متن کاملCometCloud: An Autonomic Cloud Engine
1.1 Introduction Clouds typically have highly dynamic demands for resources with highly heterogeneous and dynamic workloads. For example, the workloads associated with the application can be quite dynamic, both in terms of the number of tasks processed as well as computation requirements of each task. Furthermore, different applications may have very different and dynamic QoS requirements, for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Future Generation Comp. Syst.
دوره 27 شماره
صفحات -
تاریخ انتشار 2011